Understanding Image Motion with Group Representations
نویسندگان
چکیده
Motion is an important signal for agents in dynamic environments, but learning to represent motion from unlabeled video is a difficult and underconstrained problem. We propose a model of motion based on elementary group properties of transformations and use it to train a representation of image motion. While most methods of estimating motion are based on pixel-level constraints, we use these group properties to constrain the abstract representation of motion itself. We demonstrate that a deep neural network trained using this method captures motion in both synthetic 2D sequences and real-world sequences of vehicle motion, without requiring any labels. Networks trained to respect these constraints implicitly identify the image characteristic of motion in different sequence types. In the context of vehicle motion, this method extracts information useful for localization, tracking, and odometry. Our results demonstrate that this representation is useful for learning motion in the general setting where explicit labels are difficult to obtain.
منابع مشابه
Local velocity-adapted motion events for spatio-temporal recognition
In this paper, we address the problem of motion recognition using event-based local motion representations. We assume that similar patterns of motion contain similar events with consistent motion across image sequences. Using this assumption, we formulate the problem of motion recognition as a matching of corresponding events in image sequences. To enable the matching, we present and evaluate a...
متن کاملKinematics parameter extraction of longitudinal movement of common carotid arterial wall in healthy and atherosclerotic subjects based on consecutive ultrasonic image processing
Introduction:In this study, a non-invasive method based on consecutive ultrasonic image processing is introduced to assess time rate changes of the carotid artery wall displacement, velocity and acceleration in the longitudinal direction. The application of these parameters to discriminate healthy and atherosclerotic arteries was evaluated. Methods:Longitudinal displacement rate of common ...
متن کاملMid-level Representation for Visual Recognition
Visual Recognition is one of the fundamental challenges in AI, where the goal is to understand the semantics of visual data. Employing mid-level representation, in particular, shifted the paradigm in visual recognition. The mid-level image/video representation involves discovering and training a set of mid-level visual patterns (e.g., parts and attributes) and represent a given image/video util...
متن کاملLoosely coupled web representations: a REST service and JavaScript wrapper for sharing web-based visual representations
This paper presents the design and application of a web service architecture for providing shared access to web-based visual representations, such as dynamic models, simulations and visualizations. The Shared Representations (SR) system was created to facilitate the development of collaborative and co-operative learning activities over the web, and has been applied to provide shared group acces...
متن کاملTracking by switching state space models
We propose a novel tracking method that allows to switch between different state representations as, e.g., image coordinates in different views or image and ground plane coordinates. During the tracking process, our method adaptively switches between these representations. We demonstrate the applicability of our method for dynamic cameras tracking dynamic objects: Using the image based represen...
متن کامل